Automated Audio-to-Text Transcription in Multi-Device Teleconferences

ABSTRACT

A system and method are disclosed for generating a teleconference space for two or more communication devices using a computer coupled with a database and comprising a processor and memory. The computer generates a teleconference space and transmits requests to join the teleconference space to the two or more communication devices. The computer stores in memory identification information, and audiovisual data associated with one or more users, for each of the two or more communication devices. The computer stores audio transcription data, transmitted to the computer by each of the two or more communication devices and associated with one or more communication device users, in the computer memory. The computer merges the audio transcription data from each of the two or more communication devices into a master audio transcript, and transmits the master audio transcript to each of the two or more communication devices.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No.17/739,868, filed on May 9, 2022, entitled “Automated Audio-to-TextTranscription in Multi-Device Teleconferences,” which is a continuationof U.S. patent application Ser. No. 16/861,929, filed on Apr. 29, 2020,entitled “Automated Audio-to-Text Transcription in Multi-DeviceTeleconferences,” now U.S. Pat. No. 11,328,730, which claims the benefitunder 35 U.S.C. § 119(e) to U.S. Provisional Application No. 62/876,401,filed Jul. 19, 2019, and entitled “Automated Audio-to-Text Transcriptionin Multi-Device Teleconferences.” U.S. patent application Ser. No.17/739,868, U.S. Pat. No. 11,328,730, and U.S. Provisional ApplicationNo. 62/876,401 are assigned to the assignee of the present application.

TECHNICAL FIELD

The present disclosure relates generally to electronic teleconferencesystems and more specifically to systems and methods to generate andtransmit audio-to-text transcripts in multiple-device teleconferencesautomatically.

BACKGROUND

Teleconference systems may utilize communication networks, including butnot limited to the internet, to connect communication devices such ascomputers, tablet computers, and/or smartphones. Teleconference systemsmay permit communication devices to share visual imagery and audio dataassociated with a speaking user with other communication devices.However, multi-device and multi-user teleconferences face severalchallenges in automatically generating and storing transcripts of theteleconference. By way of example and not of limitation, teleconferencesystems may be unable to rectify discrepancies in transcripts generatedby separate communication devices, and may also fail to make accessiblea master text transcript of the teleconference quickly and easily.

SUMMARY

The following presents a simplified summary of the disclosure in orderto provide a basic understanding to the reader. This summary is not anextensive overview of the disclosure and it does not identifykey/critical elements of the invention or delineate the scope of theinvention. Its sole purpose is to present some concepts disclosed hereinin a simplified form as a prelude to more detailed descriptionspresented below.

In embodiments of the disclosed subject matter, the unique systems andmethods described herein make use of an exemplary system and method toautomate audio-to-text transcription in multi-device teleconferences.Embodiments of the disclosed subject matter include two or morecommunication devices, including but not limited to tablet computers orsmartphones, and a computer coupled with a database and comprising aprocessor and memory. The computer generates a teleconference space andtransmits requests to join the teleconference space to the two or morecommunication devices. The computer stores in memory identificationinformation and audiovisual data transmitted by each of the two or morecommunication devices.

In embodiments of the disclosed subject matter, the computer merges theaudio transcription data from each of the two or more communicationdevices into a master audio transcript. The computer transmits themaster audio transcript to each of the two or more communicationdevices.

These and other features of the disclosed subject matter are describedin greater detail below.

BRIEF DESCRIPTION OF THE DRAWINGS

A more complete understanding of the present invention may be derived byreferring to the detailed description when considered in connection withthe following illustrative figures. In the figures, like referencenumbers refer to like elements or acts throughout the figures.

FIG. 1 illustrates an exemplary teleconference system, according to afirst embodiment;

FIG. 2 illustrates an exemplary communication device of FIG. 1 ingreater detail, according to an embodiment;

FIG. 3 illustrates the cloud system of FIG. 1 in greater detail,according to an embodiment;

FIG. 4 illustrates an exemplary method of generating an audio-to-texttranscript in a teleconference space comprising multiple communicationdevices, according to an embodiment;

FIG. 5 illustrates a teleconference view, according to an embodiment;and

FIG. 6 illustrates a transcript view, according to an embodiment.

DETAILED DESCRIPTION

Aspects and applications of the invention presented herein are describedbelow in the drawings and detailed description of the invention. Unlessspecifically noted, it is intended that the words and phrases in thespecification and the claims be given their plain, ordinary, andaccustomed meaning to those of ordinary skill in the applicable arts.

In the following description, and for the purposes of explanation,numerous specific details are set forth in order to provide a thoroughunderstanding of the various aspects of the invention. It will beunderstood, however, by those skilled in the relevant arts, that thepresent invention may be practiced without these specific details. Inother instances, known structures and devices are shown or discussedmore generally in order to avoid obscuring the invention. In many cases,a description of the operation is sufficient to enable one to implementthe various forms of the invention, particularly when the operation isto be implemented in software. It should be noted that there are manydifferent and alternative configurations, devices and technologies towhich the disclosed inventions may be applied. The full scope of theinventions is not limited to the examples that are described below.

As described more fully below, embodiments of the following disclosurerelate to a teleconference system and method to create, transmit, merge,and store audio-to-text transcripts in multi-device teleconferences.Embodiments of the following disclosure generate a teleconference spacebetween two or more communication devices, each of which may be operatedby an individual or group of users. Each communication device recordsvisual and/or audio data of the one or more users associated with eachcommunication device, and transmits the visual and/or audio data toother communication devices participating in the teleconference space.Each communication device also generates a local text transcript of theaudio data of the one or more users associated with each communicationdevice, and transmits the local text transcript to a cloud system. Thecloud system merges the local text transcripts from each communicationdevice into a single master text transcript. The cloud system transmitsthe master text transcript to each communication device.

Embodiments of the following disclosure generate text transcripts ofaudio and/or audiovisual teleconferences automatically and in real time.Embodiments permit users of communication devices to view the mastertext transcript in real time as the teleconference progresses.Embodiments store the master text transcript on communication devicesand/or the cloud system, and allow communication devices to access themaster text transcript after the teleconference concludes. Embodimentsprovide a consensus mechanism for merging a plurality of local texttranscripts, which may comprise differing text transcriptions of thesame audio data communications between communication devices, into asingle master text transcript.

FIG. 1 illustrates exemplary teleconference system 100, according to afirst embodiment. Teleconference system 100 comprises two or morecommunication devices 110, cloud system 120, network 130, andcommunication links 140-142. Although communication devices 110 a-110 n,single cloud system 120, single network 130, and communication links140-142 are illustrated and described, embodiments contemplate anynumber of communication devices 110, cloud systems 120, networks 130, orcommunication links 140-142, according to particular needs.

According to embodiments, each of two or more communication devices 110comprises processor 112 and memory 114. Processor 112 may execute anoperating system program stored in memory 114 to control the overalloperation of communication device 110. For example, processor 112 maycontrol the reception of signals and the transmission of signals withinteleconference system 100. Processor 112 may execute other processes andprograms resident in memory 114, such as, for example, registration,identification or communication over communication links 140-142, andmoving data into or out of memory 114, as required by an executingprocess.

According to embodiments, one or more users may be associated with eachof one or more communication devices 110. Each of the one or more usersmay comprise, for example, an individual person or customer, one or moreemployees or teams of employees within a business, or any otherindividual, person, group of persons, business, or enterprise whichcommunicates or otherwise interacts with one or more other communicationdevices 110. Although an exemplary number of communication devices 110are shown and described, embodiments contemplate any number ofcommunication devices 110 interacting with network 130 and one or morecloud systems 120 according to particular needs.

Each communication device 110 may comprise, for example, a cellularphone or smartphone, desktop computer, laptop computer, notebookcomputer, tablet-type device, terminal, or any other communicationdevice 110 capable of receiving, transmitting, and displayingaudiovisual information through network 130. In an embodiment, eachcommunication device 110 comprises an audiovisual recording device, suchas a camera and microphone, and an audiovisual display, such as anelectronic display screen and one or more speakers. Communication device110 audiovisual recording device may record audiovisual data, comprisingvisual data and/or audio data, of one or more users associated withcommunication device 110. Communication device 110 may transmit theaudiovisual data to other communication devices 110 participating in theteleconference space, and may receive inbound audiovisual data 218transmitted by other communication devices 110 participating in theteleconference space, over network 130 and communication links 140-142.Communication device 110 may display this inbound audiovisual data 218using the audiovisual display, as described in greater detail below.

In one embodiment, cloud system 120 comprises administrator 122 anddatabase 124. Administrator 122 is programmed to merge a plurality ofcommunication device 110 text transcripts, transmitted to administrator122 by each communication device 110, into a single master texttranscript. Database 124 comprises one or more databases 124 or otherdata storage arrangements at one or more locations local to, or remotefrom, cloud system 120. In one embodiment, the one or more databases 124is coupled with administrator 122 using one or more local area networks(LANs), metropolitan area networks (MANs), wide area networks (WANs), oraforementioned network 130, such as, for example, the Internet, or anyother appropriate wire line, wireless link, or any other communicationlinks 140-142. One or more databases 124 stores data that is madeavailable to and may be used by administrator 122 according to theoperation of teleconference system 100 described below. According toembodiments, administrator 122 hosts and runs one or more runtimeprocesses associated with cloud system 120.

Each of two or more communication devices 110 may be coupled with othercommunication devices 110, as well as cloud system 120, by network 130via communication link 140. Although communication links 140 a-140 n areshown connecting each of communication devices 110 a-110 n,respectively, to network 130, embodiments contemplate any number ofcommunication links 140-142 connecting any number of communicationdevices 110 with network 130, according to particular needs. In otherembodiments, communication links 140-142 may connect two or morecommunication devices 110 directly to one another, and/or directly tocloud system 120.

According to embodiments, communication links 140-142 couplecommunication devices 110, including processor 112 and memory 114 ofeach communication device 110, and cloud system 120, includingadministrator 122 and database 124, with network 130. Each communicationlink may comprise any wireline, wireless, or other link suitable tosupport data communications between two or more communication devices110 and/or cloud system 120 and network 130. Although communicationlinks 140-142 are shown as generally coupling two or more communicationdevices 110 and cloud system 120 with network 130, two or morecommunication devices 110 and cloud system 120 may communicate directlywith each other according to particular needs.

According to embodiments, network 130 includes the Internet, telephonelines, any appropriate LANs, MANs, or WANs, and any other communicationnetwork coupling two or more communication devices 110 and cloud system120. For example, cloud system 120 may maintain data at one or morelocations external to cloud system 120, and may make the data availableto two or more communication devices 110 using network 130, or in anyother appropriate manner.

According to embodiments, two or more communication devices 110 and/orcloud system 120 may each operate on one or more computers that areintegral to or separate from the hardware and/or software that supportsteleconference system 100. As used herein, the computer, the term“computer,” and “computer system” comprise an input device and an outputdevice. The computer input device includes any suitable input device,such as a keypad, mouse, touch screen, microphone, or other device toinput information. The computer output device comprises any suitableoutput device that may convey information associated with the operationof teleconference system 100, including digital or analog data, visualinformation, or audio information. Furthermore, the one or morecomputers include any suitable fixed or removable non-transitorycomputer-readable storage media, such as magnetic computer disks,CD-ROMs, or other suitable media to receive output from and provideinput to teleconference system 100. The one or more computers alsoinclude one or more processors 112 and associated memory 114 to executeinstructions and manipulate information according to the operation ofteleconference system 100.

Embodiments contemplate two or more communication devices 110 and cloudsystem 120 generating a teleconference space. Two or more communicationdevices 110 may connect to one another, and to cloud system 120, usingnetwork 130 and communication links 140-142, to generate and participatein the teleconference space. Each communication device 110 may transmitaudiovisual data, comprising visual and/or audio data recorded by eachcommunication device 110's audiovisual recording device, to one or moreother communication devices 110 participating in the teleconferencespace. Each communication device 110 may store, in communication device110's memory 114, inbound audiovisual data 218 communication device 110receives from other communication devices 110, and may display inboundaudiovisual data 218 from other communication devices 110 oncommunication device 110's audiovisual display in the form ofteleconference view 502, as illustrated in FIG. 5 .

According to embodiments, each communication device 110 generates, andstores in communication device 110 memory 114, a local text transcriptof the audio data recorded by communication device 110's audiovisualrecording device. Each communication device 110 transmits a local texttranscript to cloud system 120 using network 130 and communication links140-142. Cloud system 120 stores the local text transcripts from eachcommunication device 110 participating in the teleconference space indatabase 124. Cloud system 120 administrator 122 merges the local texttranscripts from other communication devices 110 into a single mastertext transcript, which cloud system 120 stores in database 124. Cloudsystem 120 administrator 122 transmits the master text transcript toeach communication device 110 participating in the teleconference space.

Each communication device 110 may store the master text transcript inmemory 114 of communication device 110. Each communication device 110may display the master text transcript on an audiovisual display in theform of transcript view 602, as illustrated in FIG. 6 .

FIG. 2 illustrates exemplary communication device 110 of FIG. 1 ingreater detail, according to an embodiment. As described above,communication device 110 may comprise processor 112 and memory 114.Although communication device 110 is described as comprising a singleprocessor 112 and memory 114, embodiments contemplate any suitablenumber of processors 112, memory 114, or other data storage andretrieval components internal to or externally coupled withcommunication device 110.

According to embodiments, communication device 110 processor 112 maycomprise administration module 202, audiovisual recording module 204,transcription module 206, and graphical user interface module 208.Although processor 112 is described as comprising single administrationmodule 202, audiovisual recording module 204, transcription module 206,and graphical user interface module 208, embodiments contemplate anysuitable number of administration modules 202, audiovisual recordingmodules 204, transcription modules 206, graphical user interface modules208, and/or other modules, internal to or externally coupled withcommunication device 110, according to particular needs.

Communication device 110 memory 114 may comprise communication devicesdata 210, local device visual data 212, local device audio data 214,local device text transcript data 216, inbound audiovisual data 218, andmaster text transcript data 220. Although memory 114 is described ascomprising communication devices data 210, local device visual data 212,local device audio data 214, local device text transcript data 216,inbound audiovisual data 218, and master text transcript data 220,embodiments contemplate any suitable number or combination ofcommunication devices data 210, local device visual data 212, localdevice audio data 214, local device text transcript data 216, inboundaudiovisual data 218, master text transcript data 220, and/or otherdata, internal to or externally coupled with communication device 110,according to particular needs.

Administration module 202 of processor 112 may configure, update, and/ormanage the operation of communication device 110. That is,administration module 202 may configure, update, and/or manage thebroader operation of communication device 110 and change which data isexecuted and/or stored on each communication device 110 and/or cloudsystem 120. Administration module 202 may comprise a user-configurablesystem, such that communication device 110 processor 112 may storecommunication devices data 210, local device visual data 212, localdevice audio data 214, local device text transcript data 216, inboundaudiovisual data 218, and/or master text transcript data 220 eithersingularly or redundantly in communication device 110 memory 114 and/orcloud system 120 database 124, according to particular needs. Accordingto embodiments, administration module 202 of processor 112 may transmitinformation to and receive information from other communication devices110 and/or cloud system 120 participating in the teleconference space,as described in greater detail below.

Administration module 202 may transmit information to and receiveinformation from other communication devices 110 using network 130 andcommunication links 140-142. Transmitted/received information maycomprise local device visual data 212, local device audio data 214, orany other type of data. In an embodiment, administration module 202 mayutilize web-based real-time communication (WebRTC) protocols to transmitinformation to and receive information from administration modules 202of other communication devices 110 participating in the teleconferencespace. Upon transmitting information to or receiving information fromanother communication device 110, administration module 202 may recordunique identifying information specific to other communication device110 in communication devices data 210 of memory 114.

Communication devices data 210 of memory 114 comprises identificationinformation for one or more communication devices 110, such as, forexample, names and addresses of the one or more users associated witheach of one or more communication devices 110, company contactinformation, telephone numbers, email addresses, IP addresses, and thelike. According to embodiments, communication devices data 210 may alsocomprise information associated with the operating systems of each ofone or more communication devices 110, internet browser informationregarding each of one or more communication devices 110, or systemspecifications (such as, for example, processor 112 speed, availablememory 114, hard drive space, and the like) for each of one or morecommunication devices 110.

Communication devices data 210 may also include end user ID information,communication device 110 ID information, communication device 110 MACaddress information, or any other type of information which may identifyand track each communication device 110 participating in teleconferencesystem 100. Although particular communication devices data 210 aredescribed, embodiments contemplate any type of communication devicesdata 210 associated with one or more communication devices 110,according to particular needs.

Audiovisual recording module 204 of processor 112 may be operativelyassociated with, and may monitor and facilitate the operation of, theaudiovisual recording device of communication device 110. By way of anexample only and not by way of limitation, audiovisual recording module204 may activate the audiovisual recording device of communicationdevice 110, and may record audiovisual data, comprising visual dataand/or audio data, regarding the one or more users associated withcommunication device 110. Audiovisual recording module 204 may use theaudiovisual recording device to record visual data, such as a video fileor real-time visual stream, or one or more individual image snapshots,of one or more users associated with communication device 110. Althoughparticular examples of visual data are described herein, embodimentscontemplate audiovisual recording module 204 using the audiovisualrecording device of communication device 110 to record any form ofvisual data, according to particular needs. Audiovisual recording module204 stores the visual data in local device visual data 212 of memory114.

Local device visual data 212 of memory 114 stores visual data recordedby audiovisual recording module 204. The visual data may comprise avideo file or real-time visual stream saved in any file format, or oneor more individual image snapshots saved in any file format, of one ormore users associated with communication device 110. Although particularexamples of visual data are described, embodiments contemplateaudiovisual recording module 204 storing any form of visual data inlocal device visual data 212, according to particular needs.

Audiovisual recording module 204 may also use communication device 110audiovisual recording device to record audio data of one or more usersassociated with communication device 110. The audio data may comprise,for example, a recorded audio file of one or more speaking users.Although particular audio data are described herein, embodimentscontemplate audiovisual recording module 204 using the audiovisualrecording device of communication device 110 to record any form of audiodata, according to particular needs. Audiovisual recording module 204stores the audio data in local device audio data 214 of memory 114.

Local device audio data 214 of memory 114 stores audio data recorded byaudiovisual recording module 204. The audio data may comprise any formof audio data stored in any file format, according to particular needs.

According to embodiments, transcription module 206 accesses local deviceaudio data 214 and transcribes the audio data stored in local deviceaudio data 214 into a local device text transcript. Transcription module206 may identify one or more speaking users whose voices are recorded inlocal device audio data 214, and may transcribe their spoken words intoa written local device text transcript, which transcription module 206stores in local device text transcript data 216. Transcription module206 may use one or more algorithms to parse local device audio data 214into one or more discrete words, including but not limited to deeplearning processes, artificial intelligence (AI) processes, and/orpredictive algorithms which estimate the next words in a user's sentencebased on the user's previously-spoken words or sentences. Transcriptionmodule 206 may use one or more algorithms to separate spoken words inlocal device audio data 214 from background noises orunintended/unspoken sounds in local device audio data 214. Transcriptionmodule 206 may also sort words into punctuated sentences, and mayinterpret pauses in spoken words as breaks between transcript sentences.In an embodiment, transcription module 206 may update the local devicetext transcript substantially in real time; in other embodiments,transcription module 206 may update the local device text transcriptonce every second, once every five seconds, or at any other frequency.Transcription module 206 may analyze the vocal pitch and/or soundfrequency of spoken words to associate each word with a particular userassociated with communication device 110 (such as, in an embodiment,associating words spoken in a lower pitch with a particular male user,and words spoken in a higher pitch with a particular female user), andmay store information associating each word with a particular user inlocal device text transcript data 216. Transcription module 206 may alsoassociate chronological and/or timestamp information with each word, andmay store the chronological and/or timestamp information in local devicetext transcript data 216.

Local device text transcript data 216 of memory 114 stores the localdevice text transcript generated by transcription module 206. Accordingto embodiments, transcription module 206 may store the local device texttranscript as a plain text file, an encrypted file, or in any otherformat capable of storing a text file comprising transcribed audio data.Transcription module 206 may also store chronological and/or timestampinformation associated with each transcribed word in local device texttranscript data 216.

In an embodiment, administration module 202 transmits the visual datastored in local device visual data 212 of communication device 110memory 114, and/or the audio data stored in local device audio data 214of communication device 110 memory 114, to each of other communicationdevices 110 participating in the teleconference space. Similarly, othercommunication devices 110 participating in the teleconference spacetransmit visual data and/or audio data from those communication devices110 to administration module 202. Administration module 202 mayassociate each of the inbound visual data and/or audio datatransmissions with the particular communication device 110 thattransmitted the inbound visual and/or audio data transmission, and maystore the identification and association information in communicationdevices data 210 of memory 114. Administration module 202 stores theinbound visual data and/or audio data from other communication devices110 in inbound audiovisual data 218 of communication device 110 memory114. In an embodiment, administration module 202 may transmit the visualdata and/or the audio data to each of other communication devices 110substantially in real time; in other embodiments, administration module202 may transmit the visual data and/or the audio data to each of othercommunication devices 110 once every second, once every five seconds, orat any other frequency.

Inbound audiovisual data 218 of communication device 110 memory 114stores the inbound visual data and/or audio data from one or more othercommunication devices 110 participating in the teleconference space. Inan embodiment, inbound audiovisual data 218 stores informationassociating each particular visual data and/or audio data transmissionwith communication device 110 that transmitted that particular visualdata and/or audio data.

According to embodiments, graphical user interface module 208 ofcommunication device 110 processor 112 generates a graphical userinterface display comprising and displaying features of thecommunication devices data 210, local device visual data 212, localdevice audio data 214, local device text transcript data 216, inboundaudiovisual data 218, and master text transcript data 220. In anembodiment, graphical user interface module 208 accesses local devicevisual data 212, local device audio data 214, and inbound audiovisualdata 218, and generates teleconference view 502, illustrated in FIG. 5 .Teleconference view 502 displays visual and/or audio data associatedwith one or more communication devices 110 participating in theteleconference space, as described in greater detail below. Graphicaluser interface module 208 may generate different graphical userinterface displays conveying different types of information about theteleconference space, teleconference view 502, and transcript view 602,according to embodiments.

According to embodiments, master text transcript data 220 ofcommunication device 110 memory 114 stores data pertaining to the mastertext transcript. Cloud system 120 generates the master text transcript,and transmits the master text transcript to administration module 202 ofcommunication device 110 using network 130 and communication links140-142. Administration module 202 stores the master text transcript inmaster text transcript data 220 of communication device 110 memory 114.According to embodiments, administration module 202 may store the mastertext transcript as a plain text file, an encrypted file, or in any otherformat capable of storing a text file comprising transcribed audio data.In an embodiment, graphical user interface module 208 accesses mastertext transcript data 220 and generates transcript view 602, illustratedin FIG. 6 .

FIG. 3 illustrates cloud system 120 of FIG. 1 in greater detail,according to an embodiment. As discussed above, cloud system 120 maycomprise one or more computers at one or more locations includingassociated input devices, output devices, non-transitorycomputer-readable storage media, processors 112, memory 114, or othercomponents to send and receive information between one or morecommunication devices 110 according to the operation of teleconferencesystem 100. In addition, cloud system 120 comprises administrator 122and database 124. Although cloud system 120 is described as comprising asingle administrator 122 and database 124, embodiments contemplate anysuitable number of administrators 122 or databases 124 internal to orexternally coupled with cloud system 120. In addition, or as analternative, cloud system 120 may be located internal to one or morecommunication devices 110. For example, in an embodiment in which one ofcommunication devices 110 is a computer, the computer may also functionas cloud system 120 and may generate the teleconference space for otherparticipating communication devices 110. In other embodiments, cloudsystem 120 may be located external to one or more communication devices110.

According to embodiments, administrator 122 comprises transmissionmodule 302 and merging module 304. Although a particular configurationof administrator 122 is shown and described, embodiments contemplate anysuitable number or combination of transmission modules 302, mergingmodules 304, and/or other modules, located at one or more locationslocal to, or remote from cloud system 120, according to particularneeds. In addition, or as an alternative, transmission module 302 andmerging module 304 may be located on multiple administrators 122 orcomputers at any location in teleconference system 100.

Database 124 comprises local device text transcript data 310 and mastertext transcript data 312. Although database 124 is shown and describedas comprising local device text transcript data 310 and master texttranscript data 312, embodiments contemplate any suitable number orcombination of local device text transcript data 310, master texttranscript data 312, and/or other data pertaining to teleconferencesystem 100 located at one or more locations, local to, or remote from,cloud system 120, according to particular needs.

According to embodiments, transmission module 302 of cloud system 120administrator 122 transmits information to, and receives informationfrom, administration modules 202 of each of two or more communicationdevices 110 participating in the teleconference space. Morespecifically, each communication device 110 administration module 202may transmit, to transmission module 302 of cloud system 120, localdevice text transcript data 216 generated by each particularcommunication device 110. Transmission module 302 may store each of theother local device text transcripts in local device text transcript data310 of cloud system 120 database 124.

Local device text transcript data 310 of cloud system 120 may store datapertaining to the local device text transcript of each communicationdevice 110. As described above, each communication device 110participating in the teleconference space generates a local device texttranscript, including chronological and/or timestamp informationassociated with each transcribed word, and transmits the local devicetext transcript to transmission module 302 of cloud system 120administrator 122. Transmission module 302 may also record, in localdevice text transcript data 310, unique identifying informationregarding each communication device 110 that transmitted each of thelocal device text transcripts.

According to embodiments, merging module 304 of cloud system 120administrator 122 accesses each of the local device text transcriptsstored in local device text transcript data 310. Merging module 304merges the local device text transcripts into a single master texttranscript. Merging module 304 may use the unique identifyinginformation regarding which communication device 110 transmitted each ofthe local device text transcripts, and/or the chronological and/ortimestamp information associated with each transcribed word, toassociate one or more users with each local device text transcript, andto assign dialogue text contained in each local device text transcriptto one or more communication devices 110 and/or one or more associatedusers at particular points in time. Merging module 304 may arrange thedialogue text from each communication device 110 local device texttranscripts into a single chronological master text transcriptcomprising all dialogue text from all local device text transcripts.Merging module 304 may update the master text transcript as transmissionmodule 302 stores updated local device text transcripts in local devicetext transcript data 310 of database 124. In an embodiment, mergingmodule 304 may update the master text transcript substantially in realtime; in other embodiments, merging module 304 may update the mastertext transcript once every second, once every five seconds, or at anyother frequency.

According to embodiments, merging module 304 uses one or more consensusmechanisms to rectify discrepancies between the local device texttranscripts and to merge the local device text transcripts into a singlemaster text transcript. For example, in an embodiment, merging module304 may utilize a majority consensus mechanism to merge the local devicetext transcripts into a single master text transcript. In this example,merging module 304 implementing the majority consensus mechanismpreferentially chooses dialogue text present in the majority of localdevice text transcripts over dialogue text present in only a minority oflocal device text transcripts. To provide a more specific example ofmerging module 304 using a majority consensus mechanism, in anembodiment in which merging module 304 accesses three other local devicetext transcripts stored in local device text transcript data 310, andtwo of these local device text transcripts contain the sentence “I tiedthe knot” at a particular chronological point in the transcript whereasthe third local device text transcript contains the sentence “I tied thenot,” at the same chronological point in the transcript, merging module304 selects “I tied the knot” and not “I tied the not” to include in themaster text transcript. Although particular consensus mechanisms aredescribed herein, embodiments contemplate merging module 304 utilizingany consensus mechanism to rectify inconsistencies in the plurality oflocal device text transcripts, according to particular needs.

According to embodiments, and having generated the master texttranscript, merging module 304 stores the master text transcript inmaster text transcript data 312 of cloud system 120 database 124.Merging module 304 may store the master text transcript as a plain textfile, an encrypted file, or in any other format capable of storing atext file.

According to embodiments, transmission module 302 accesses the mastertext transcript stored in master text transcript data 312. Transmissionmodule 302 transmits the master text transcript to the administrationmodules 202 of each of two or more communication devices 110participating in the teleconference space using network 130 andcommunication links 140-142. Each communication device 110administration module 202 stores the master text transcript in mastertext transcript data 220 of communication device 110 memory 114. In anembodiment, transmission module 302 may transmit the master texttranscript to each of other communication devices 110 substantially inreal time; in other embodiments, transmission module 302 may transmitthe master text transcript to each of other communication devices 110once every second, once every five seconds, or at any other frequency.

FIG. 4 illustrates exemplary method 400 of generating an audio-to-texttranscript in a teleconference space comprising multiple communicationdevices 110, according to an embodiment. Method 400 of generating anaudio-to-text transcript in a teleconference space comprising multiplecommunication devices 110 proceeds by one or more actions, whichalthough described in a particular order may be performed in one or morepermutations or alternative orders, according to particular needs. In anembodiment, the actions may comprise: generating a teleconference spaceas action 402, transmitting visual and/or audio data as action 404,generating communication device 110 local device text transcripts asaction 406, transmitting local device text transcripts to cloud system120 as action 408, generating a master text transcript as action 410,and transmitting the master text transcript to communication devices 110as action 412.

At action 402, teleconference system 100 generates a teleconferencespace. In an embodiment, cloud system 120 administrator 122 transmitsrequests, using network 130 and communication links 140-142, to one ormore communication devices 110 to join the teleconference space. Inother embodiments, one or more communication devices transmit requeststo cloud system 120 to generate a teleconference space. Administrationmodule 202 of each communication device 110 may store, in communicationdevices data 210, data that identifies each communication device 110participating in the teleconference space with a unique communicationdevice 110 ID, and each user associated with each computer with aparticular user ID. Communication device 110 administration modules 202may transmit communication devices data 210 to one or more othercommunication devices 110 participating in the teleconference space,and/or to cloud system 120. Communication device 110 administrationmodules 202 may store, in communication devices data 210, communicationdevice 110 ID and user ID data of all other communication devices 110participating in the teleconference space.

At action 404, each communication device 110 transmits visual data andaudio data to one or more other communication devices 110 participatingin the teleconference space, and to cloud system 120, using network 130and communication links 140-142. In other embodiments, eachcommunication device 110 may transmit visual data, audio data, or acombination of visual and audio data, to other communication devices 110participating in the teleconference space. Audiovisual recording module204 of each communication device 110 activates communication device 110audiovisual recording device, and records audiovisual data, comprisingvisual data and audio data, of the user associated with communicationdevice 110. Audiovisual recording module 204 of each communicationdevice 110 stores the visual data of the associated user in local devicevisual data 212 of communication device 110 memory 114, and stores theaudio data of the associated user in local device audio data 214.Administration module 202 of each communication device 110 accesses thevisual data stored in local device visual data 212, and the audio datastored in local device audio data 214, and transmits the visual data andthe audio data to one or more other communication devices 110participating in the teleconference space. Administration module 202 ofeach communication device 110 receives the inbound visual data and theinbound audio data transmitted by the other communication devices 110participating in the teleconference space. Administration module 202 ofeach communication device 110 associates each of the inbound visual dataand audio data transmissions with the particular communication device110 that transmitted each particular visual and audio data transmission,and stores the identification and association information incommunication devices data 210 of communication device 110 memory 114.Administration module 202 of each communication device 110 stores theinbound visual data and audio data in inbound audiovisual data 218. Inan embodiment, graphical user interface module 208 of each communicationdevice 110 accesses local device visual data 212 of communication device110, local device audio data 214 of communication device 110, andinbound audiovisual data 218 of communication device 110, and generatesteleconference view 502, illustrated by FIG. 5 .

At action 406, each communication device 110 generates a local devicetext transcript of the audio data stored in local device audio data 214of communication device 110. Transcription module 206 of eachcommunication device 110 accesses local device audio data 214, andtranscribes the audio data stored in local device audio data 214 into alocal device text transcript. Transcription module 206 of eachcommunication device 110 stores the local device text transcript inlocal device text transcript data 216 of communication device 110 memory114.

At action 408, each of communication devices 110 participating in theteleconference space transmits the local device text transcript to cloudsystem 120. According to embodiments, administration module 202 of eachcommunication device 110 accesses the local device text transcriptstored in local device text transcript data 216 of communication device110. Administration module 202 of each communication device 110transmits the local device text transcript to transmission module 302 ofcloud system 120, using network 130 and communication links 140-142.According to embodiments, administration module 202 may transmit thelocal device text transcript substantially in real time, once everysecond, once every two seconds, or at any other interval of time.Transmission module 302 of cloud system 120 receives local device texttranscripts from each communication device 110 participating in theteleconference space, and stores each local device text transcriptseparately in local device text transcript data 310 of cloud system 120database 124.

At action 410, cloud system 120 merging module 304 accesses the localdevice text transcripts stored in local device text transcript data 310,and merges them into a single master text transcript. Merging module 304stores the master text transcript in master text transcript data 312 ofdatabase 124. According to embodiments, merging module 304 may use amajority consensus mechanism to merge the local device text transcriptsinto the single master text transcript. In other embodiments, mergingmodule 304 may use any consensus mechanism to merge the local devicetext transcripts into the master text transcript, and may perform themerging actions described above substantially in real time, once asecond, once every two second, or at any other interval of time.

At action 412, cloud system 120 transmits the master text transcript toeach communication device 110 participating in the teleconference space.Cloud system 120 transmission module 302 accesses the master texttranscript stored in cloud system 120 master text transcript data 312,and transmits the master text transcript to the administration modules202 of each communication device 110 participating in the teleconferencespace. Transmission module 302 may access and transmit the master texttranscript substantially in real time, once every second, once every twoseconds, or at any other interval of time. Upon receiving the mastertext transcript from transmission module 302, administration module 202of each communication device 110 stores the master text transcript inmaster text transcript data 220 of communication device 110 memory 114.According to embodiments, graphical user interface module 208 of eachcommunication device 110 accesses master text transcript data 220, andgenerates transcript view 602, illustrated by FIG. 6 , whichcommunication device 110 displays on the audiovisual display. Eachcommunication device 110 may display either teleconference view 502 ortranscript view 602 on the audiovisual display, and may permit switchingback and forth between teleconference view 502 and transcript view 602,independently of other communication devices 110 participating in theteleconference space.

Teleconference system 100 terminates method 400 when communicationdevices 110 disconnect from the teleconference space. According toembodiments, cloud system 120 may store one or more master texttranscripts in master text transcript data 312. At a later time, one ormore communication devices 110 may connect to cloud system 120 usingnetwork 130 and communication links 140-142, and may access the one ormore master text transcripts stored in master text transcript data 312.

In order to illustrate the operation of generating an audio-to-texttranscript in a teleconference space comprising multiple communicationdevices 110, an example is now given. In the following example,exemplary teleconference system 100 comprises four communication devices110, cloud system 120, and network 130 and communication links 140-142connecting communication devices 110 and cloud system 120 to oneanother. Although a particular number of communication devices 110,cloud systems 120, networks 130, and communication links 140-142 aredescribed, embodiments contemplate any number of communication devices110, cloud systems 120, networks 130, or communication links 140-142,according to particular needs.

Continuing the example, each of the four exemplary communication devices110 comprises a computer. Each computer comprises an audiovisualrecording device (comprising a camera and microphone), an audiovisualdisplay (comprising an electronic display screen and one or morespeakers), and an input device (comprising a keyboard). In addition, inthis example a single user is associated with each computer; in otherembodiments, any number of users may be associated with any number ofcommunication devices 110, according to particular needs.

Continuing the example, at action 402, communication device 110computers generate a teleconference space. Administration module 202 ofeach computer transmits communication devices data 210, which identifieseach computer with a unique communication device 110 ID, and each userassociated with each computer with a particular user ID, to theadministration modules 202 of the three other computers, and to cloudsystem 120, using network 130 and communication links 140-142.Administration module 202 of each computer stores, in communicationdevices data 210 of the computer, communication device 110 ID and userID of the three other computers participating in the teleconferencespace.

Continuing the example, at action 404, each computer transmits visualdata and audio data to the other computers participating in theteleconference space, and to cloud system 120, using network 130 andcommunication links 140-142. In other embodiments, each participatingcommunication device 110 may transmit visual data, audio data, or acombination of visual and audio data, to other communication devices 110participating in the teleconference space. Audiovisual recording module204 of each computer activates the computer's associated audiovisualrecording device, and records audiovisual data, comprising visual dataand audio data, of the user associated with the computer. Audiovisualrecording module 204 of each computer stores the visual data of the userassociated with the computer in local device visual data 212 of thecomputer memory 114, and stores the audio data of the user associatedwith the computer in local device audio data 214. Administration module202 of each computer accesses the visual data stored in local devicevisual data 212 of the computer, and the audio data stored in localdevice audio data 214 of the computer, and transmits the visual data andthe audio data to each of the other computers participating in theteleconference space.

Continuing the example, administration module 202 of each computerreceives the inbound visual data and the inbound audio data transmittedby the other computers participating in the teleconference space.Administration module 202 of each computer associates each of theinbound visual data and audio data transmissions with the particularcomputer that transmitted each particular visual and audio datatransmission, and stores this identification and association informationin communication devices data 210 of the computer memory 114.Administration module 202 of each computer stores the inbound visualdata and audio data in inbound audiovisual data 218 of the computermemory 114. Graphical user interface module 208 of each computeraccesses local device visual data 212 of the computer, local deviceaudio data 214 of the computer, and inbound audiovisual data 218 of thecomputer, and generates teleconference view 502 that the computerdisplays on the audiovisual display.

FIG. 5 illustrates teleconference view 502, according to an embodiment.Continuing the example, teleconference view 502 displays teleconferencewindow 504 and participant panel 506. Although teleconference window 504and participant panel 506 are shown and described in a particularconfiguration, embodiments contemplate teleconference windows 504 and/orparticipant panels 506 of various configurations and placements,according to particular needs.

Teleconference window 504, occupying a large area of the central portionof teleconference view 502 illustrated in FIG. 5 , displays the visualdata of one or more computers participating in the teleconference space.Although teleconference window 504 is shown and described as occupyingthe center portion of teleconference view 502, embodiments contemplateteleconference view 502 displaying teleconference window 504 in anyconfiguration. In the embodiment illustrated in FIG. 5 and described inthe continuing example, graphical user interface module 208 of eachcomputer accesses local device visual data 212 and inbound audiovisualdata 218, and continuously displays the visual data of all computersparticipating in the teleconference space. In other embodiments,graphical user interface module 208 may select any visual data or anycombination of visual data to display in teleconference window 504 ofteleconference view 502, including, for example, displaying the videodata transmitted by the computer associated with whichever user iscurrently speaking. Although a particular configuration ofteleconference window 504 is shown and described, embodimentscontemplate teleconference views 502 displaying visual components and/orteleconference windows 504 in any configuration, according to particularneeds.

According to embodiments and continuing the example, participant panel506 on the right side of teleconference view 502 illustrated in FIG. 5displays a visual representation of the computers currentlyparticipating in the teleconference space. Although participant panel506 is shown and described as occupying the right side of teleconferenceview 502, embodiments contemplate teleconference view 502 displayingparticipant panel 506 in any configuration. According to embodiments,participant panel 506 may identify teleconference space participants bythe names of the users associated with the other computers, or byidentifying the computers themselves (such as, for example, “Acer One”).In an embodiment, administration module 202 may assign names to othercommunication devices 110 displayed in participant panel 506 usinginformation contained in the communication devices data 210. Continuingthe example, participant panel 506 of exemplary teleconference view 502lists the four participating computers. Although a specific participantpanel 506 configuration is shown and described, embodiments contemplateteleconference views 502 displaying participant panels 506 in anyconfiguration, according to particular needs.

Continuing the example, at action 406, each computer generates a localdevice text transcript of the audio data stored in local device audiodata 214 of the computer. Transcription module 206 of each computeraccesses local device audio data 214 of that computer, and transcribesthe audio data stored in local device audio data 214 into a local devicetext transcript. Transcription module 206 of each computer, havinggenerated a local device transcript of the computer's audio data, storesthe local device text transcript in local device text transcript data216 of the computer memory 114. In this example, transcription module206 of each computer accesses local device audio data 214 of thatcomputer, transcribes the audio data stored therein, and updates thelocal device text transcript stored in local device text transcript data216, once every second. In other embodiments, transcription module 206may update the local device text transcript substantially in real time,once every two seconds, or at any other interval of time.

Continuing the example, at action 408, each of the computersparticipating in the teleconference space transmits the local devicetext transcript to cloud system 120. More specifically, administrationmodule 202 of each computer accesses the local device text transcriptstored in local device text transcript data 216 of the computer.Administration module 202 of each computer transmits the computer'slocal device text transcript to transmission module 302 of cloud system120, using network 130 and communication links 140-142. In this example,administration module 202 of each computer accesses the local devicetext transcript stored in local device text transcript data 216 of thecomputer memory 114, and transmits the local device text transcript tocloud system 120 transmission module 302 once every second. In otherembodiments, administration module 202 may transmit the local devicetext transcript substantially in real time, once every two seconds, orat any other interval of time.

Transmission module 302 of cloud system 120 receives local device texttranscripts from each of the four computers participating in theteleconference space, and stores each local device text transcriptseparately in local device text transcript data 310 of cloud system 120database 124, once every second. In other embodiments, administrationmodule 202 may store local device text transcripts in local device texttranscript data 310 substantially in real time, once every two seconds,or at any other interval of time.

Continuing the example, at action 410, cloud system 120 merging module304 accesses the local device text transcripts stored in local devicetext transcript data 310, and merges them into a single master texttranscript. Merging module 304 stores the master text transcript inmaster text transcript data 312 of database 124. Continuing with theexample, merging module 304 uses a majority consensus mechanism to mergethe local device text transcripts into the single master texttranscript. That is, merging module 304 accesses the local device texttranscripts stored in local device text transcript data 310, merges thelocal device text transcripts into the master text transcript, andstores the master text transcript in master text transcript data 312,once every second. In other embodiments, merging module 304 may use anyconsensus mechanism to merge the local device text transcripts into themaster text transcript, and may perform the merging actions describedabove substantially in real time, once every two second, or at any otherinterval of time.

Continuing the example, at action 412, cloud system 120 transmits themaster text transcript to each of the computers participating in theteleconference space. Cloud system 120 transmission module 302 accessesthe master text transcript stored in cloud system 120 master texttranscript data 312, and transmits the master text transcript to theadministration modules 202 of each of the computers participating in theteleconference space. In this example, transmission module 302 accessesthe master text transcript stored in master text transcript data 312 andtransmits the master text transcript once every second; in otherembodiments, transmission module 302 may access and transmit the mastertext transcript substantially in real time, once every two seconds, orat any other interval of time. Upon receiving the master text transcriptfrom transmission module 302, administration module 202 of each computerstores the master text transcript in master text transcript data 220 ofcomputer memory 114. Graphical user interface module 208 of eachcomputer accesses master text transcript data 220 of the computer, andgenerates transcript view 602 which the computer displays on theaudiovisual display. Each computer may display either teleconferenceview 502 or transcript view 602 on an audiovisual display, and maypermit switching back and forth between teleconference view 502 andtranscript view 602, independently of other computers participating inthe teleconference space.

FIG. 6 illustrates transcript view 602, according to an embodiment. Inan embodiment, transcript view 602 comprises transcript column 604 andparticipant panel 506. Although transcript column 604 and participantpanel 506 are shown and described in a particular configuration,embodiments contemplate transcript columns 604 and/or participant panels506 of various configurations and placements, according to particularneeds.

According to embodiments, transcript column 604 occupies the majority ofthe center of transcript view 602. Although transcript column 604 isshown and described as occupying the center portion of transcript view602, embodiments contemplate transcript view 602 displaying transcriptcolumn 604 in any configuration. Graphical user interface module 208 mayaccess master text transcript data 220, and may display the texttranscription stored in master text transcript data 220 as individualmessages in transcript column 604. In one embodiment, graphical userinterface module 208 may display each sentence stored in master texttranscript data 220 as an individual message. In other embodiments,graphical user interface module 208 may display master text transcriptdata 220 on transcript column 604 in any format, such as, for example,by associating multiple sentences spoken by a particular user into asingle transcript column 604 message.

Continuing with the example, transcript column 604 displays the mastertext transcript as the computers participate in the teleconferencespace. In this embodiment, graphical user interface module 208 accessesthe master text transcript stored in master text transcript data 220,and updates transcript column 604 with the most up-to-date texttranscription stored in master text transcript data 220, once everysecond. In other embodiments, graphical user interface module 208 mayupdate transcript column 604 substantially in real time, once every twoseconds, or at any other interval of time.

According to embodiments, participant panel 506 shown on the right sideof transcript view 602 of FIG. 6 displays a visual representation ofcommunication devices 110 currently participating in the teleconferencespace. Transcript view 602 may identify participating communicationdevices 110 by the names of the users associated with communicationdevices 110 (such as “Jean-Baptiste”), or by identifying communicationdevices 110 themselves (such as “Acer One,” and the like). In anembodiment, administration module 202 may assign names to othercommunication devices 110 displayed in participant panel 506 usinginformation contained in the communication devices data 210. Transcriptview 602 may display the names of communication devices 110participating in the teleconference space under, or in association with,each individual message displayed in transcript view 602. Concluding theexample, teleconference system 100 terminates method 400 when thecomputers disconnect from the teleconference space.

Reference in the foregoing specification to “one embodiment”, “anembodiment”, or “some embodiments” means that a particular feature,structure, or characteristic described in connection with the embodimentis included in at least one embodiment of the invention. The appearancesof the phrase “in one embodiment” in various places in the specificationare not necessarily all referring to the same embodiment.

While the exemplary embodiments have been shown and described, it willbe understood that various changes and modifications to the foregoingembodiments may become apparent to those skilled in the art withoutdeparting from the spirit and scope of the present invention.

What is claimed is:
 1. A system, comprising: two or more communicationdevices, each of the two or more communication devices configured toreceive inbound audio and visual data from at least one othercommunication device and generate a selectable display of ateleconference view or a transcript view; and a computer coupled with adatabase and comprising a processor and memory, the computer configuredto: generate a teleconference space; identify two or more users based oncorresponding communication device data gathered from the two or morecommunication devices participating in the teleconference space;transmit a master audio transcript to each of the two or morecommunication devices; and display, in response to a selection of theselectable display, the transcript view of the master audio transcriptfrom each of the two or more telecommunication devices or theteleconference view of the visual data.
 2. The system of claim 1,wherein the computer is further configured to: separate spoken wordsfrom background noises in the received inbound audio data.
 3. The systemof claim 1, wherein the computer is further configured to: analyze voicepitch or sound frequency to associate each transcribed word with aparticular user associated with one of the two or more communicationdevices.
 4. The system of claim 1, wherein the computer is furtherconfigured to: sort transcribed words into punctuated sentences; andinterpret pauses in spoken words as breaks between transcribedsentences.
 5. The system of claim 1, wherein the computer is furtherconfigured to: associate chronological information with each transcribedword; and store the chronological information in local device texttranscript data.
 6. The system of claim 1, wherein the computer isfurther configured to: transmit the master audio transcript to each ofthe two or more communication devices once every second or once everyfive seconds.
 7. The system of claim 1, wherein the computer is furtherconfigured to: transmit visual data comprising a video file, areal-visual stream or one or more image snapshots.
 8. Acomputer-implemented method, comprising: configuring each of two or morecommunication devices to receive inbound audio and visual data from atleast one other communication device and generate a selectable displayof a transcript view or a teleconference view; generating, using acomputer coupled with a database and comprising a processor and memory,a teleconference space in which the two or more communication devicesparticipate; identifying, by the computer, two or more users based oncorresponding communication device data gathered from the two or morecommunication devices participating in the teleconference space;transmitting a master audio transcript to each of the two or morecommunication devices; and displaying, in response to a selection of theselectable display, the transcript view of the master audio transcriptfrom each of the two or more telecommunication devices or theteleconference view of the visual data.
 9. The computer-implementedmethod of claim 8, further comprising separating, by the computer,spoken words from background noises in the received inbound audio data.10. The computer-implemented method of claim 8, further comprisinganalyzing, by the computer, voice pitch or sound frequency to associateeach transcribed word with a particular user associated with one of thetwo or more communication devices.
 11. The computer-implemented methodof claim 8, further comprising: sorting, by the computer, transcribedwords into punctuated sentences; and interpreting, by the computer,pauses in spoken words as breaks between transcribed sentences.
 12. Thecomputer-implemented method of claim 8, further comprising: associating,by the computer, chronological information with each transcribed word;and storing, by the computer, the chronological information in localdevice text transcript data.
 13. The computer-implemented method ofclaim 8, further comprising: transmitting, by the computer, the masteraudio transcript to each of the two or more communication devices onceevery second or once every five seconds.
 14. The computer-implementedmethod of claim 8, further comprising: transmitting, by the computer,visual data comprising a video file, a real-visual stream or one or moreimage snapshots.
 15. A non-transitory computer-readable storage mediumembodied with software, the software when executed: configures each oftwo or more communication devices to receive inbound audio and visualdata from at least one other communication device and generate aselectable display of a transcript view or a teleconference view;generates, using a computer coupled with a database and comprising aprocessor and memory, a teleconference space in which the two or morecommunication devices participate; identifies, by the computer, two ormore users based on corresponding communication device data gatheredfrom the two or more communication devices participating in theteleconference space; transmits a master audio transcript to each of thetwo or more communication devices; and displays, in response to aselection of the selectable display, the transcript view of the masteraudio transcript from each of the two or more telecommunication devicesor the teleconference view of the visual data.
 16. The non-transitorycomputer-readable storage medium of claim 15, wherein the software whenexecuted further: separates spoken words from background noises in thereceived inbound audio data.
 17. The non-transitory computer-readablestorage medium of claim 15, wherein the software when executed further:analyzes voice pitch or sound frequency to associate each transcribedword with a particular user associated with one of the two or morecommunication devices.
 18. The non-transitory computer-readable storagemedium of claim 15, wherein the software when executed further: sortstranscribed words into punctuated sentences; and interprets pauses inspoken words as breaks between transcribed sentences.
 19. Thenon-transitory computer-readable storage medium of claim 15, wherein thesoftware when executed further: associates chronological informationwith each transcribed word; and stores the chronological information inlocal device text transcript data.
 20. The non-transitorycomputer-readable storage medium of claim 19, wherein the software whenexecuted is further: transmits visual data comprising a video file, areal-visual stream or one or more image snapshots.